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1 Introduction 

Judea Pearl claimed that DST of evidence fails to provide a reasonable solution for the combination of evidence even 
for apparently very simple fusion problem 1121 1131 . Most criticisms are answered by Philippe Smets in l23l l~24| . The 
Tweety Penguin Triangle Problem (TP2) is one of the typical exciting and challenging problem for all theories managing 
uncertainty and conflict because it shows the real difficulty to maintain truth for automatic reasoning systems when the 
classical property of transitivity (which is basic to the material-implication) does not hold. In his book m, Judea Pearl 
presents and discusses in details the semantic clash between Bayes vs. Dempster-Shafer reasoning. We present here our 
new analysis on this problem and provide a solution of the Tweety Penguin Triangle Problem based on our new theory of 
plausible and paradoxical reasoning, known as DSmT (Dezert-Smarandache Theory). We show how this problem can be 
attacked and solved by our new reasoning with help of the (hybrid) DSm rule of combination ED 

The purpose of this paper is not to browse all approaches available in literature for attacking the TP2 problem but 
only to provide a comparison of the DSm reasoning with respect to the Bayesian reasoning and to the plausible reasoning 
of DST framework. Interesting but complex analysis on this problem based on default reasoning and e-belief functions 
can be also found by example in m and (D- Other interesting and promising issues for the TP2 problem based on the 
fuzzy logic of Zadeh 1261 jointly with the theory of possibilities |6jP7| are under investigations. Some theoretical research 
works on new conditional event algebras (CEA) have emerged in literature 0 since last years and could offer a new track 
for attacking the TP2 problem although unfortunately no clear didactic, simple and convincing examples are provided to 
show the real efficiency and usefulness of these theoretical investigations. 

2 The Tweety Penguin Triangle Problem 

This very important and challenging problem, as known as the Tweety Penguin Triangle Problem (TP2) in literature, is 
presented in details by Judea Pearl in m. We briefly present here the TP2 and the solutions based first on fallacious 
Bayesian reasoning and then on the Dempster-Shafer reasoning. We will then focus our analysis of this problem from the 
DSmT framework and the DSm reasoning. 

Let’s consider the set R = {ri, r2, ra} of given rules: 

• n: ’’Penguins normally don’t fly” <t=> (p —> -if) 

• T'2 : ’’Birds normally fly” O {b — » /) 

• 7 - 3 : ’’Penguins are birds” o- (p — > b) 
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To emphasize our strong conviction in these rules we commit them some high confidence weights w 1, W 2 and W 3 in [0, 1] 
with wi = 1 — ei, W 2 = 1 — £2 and W 3 = 1 (where ei and 62 are small positive quantities). The conviction in these rules 
is then represented by the set W = {wi, u>2, UI3} in the sequel. 

Another useful and general notation adopted by Judea Pearl in the first pages of his book (32) to characterize these 
three weighted rules is the following one (where w\,W 2 ,W 3 € [0, 1]): 

VJl / r\ 7 VJ2 r W 3 , 

ri-P ->(-'/) i "2 '■ b —> j r 3 :p-+b 

When w\,W 2 ,W 3 € {0, 1} the classical logic is the perfect tool to conclude on the truth or on the falsity of a propo- 
sition built from these rules based on the standard propositional calculus mainly with its three fundamental rules (Modus 
Ponens, Modus Tollens and Modus Barbara - i.e. transitivity rule). When 0 < w\, W 2 , W 3 < 1, the classical logic can’tbe 
applied because the Modus Ponens, the Modus Tollens and the Modus Barbara do not longer hold and some other tools 
must be chosen. This will discussed in detail in section 0~2I 

Question: Assume we observe an animal called Tweety (T) that is categorically classified as a bird (b) and a penguin (p), 
i.e. our observation is O = [T = {b n p)] = [(T = b) fl (T = p)]. The notation T = (bC\p) stands here for ’’Entity T 
holds property (6 fl p)”. What is the belief (or the probability - if such probability exists) that Tweety can fly given the 
observation O and all information available in our knowledge base (i.e. our rule -based system R and W) ? 

The difficulty of this problem for most of artificial reasoning systems (ARS) comes from the fact that, in this example, 
the property of transitivity, usually supposed satisfied from material-implication interpretation (p — > b, b — » /) => 

(j) — > /) does not hold here (see section F.2t . In this interesting example, the classical property of inheritance is thus 
broken. Nevertheless a powerful artificial reasoning system must be able to deal with such kind of difficult problem and 
must provide a reliable conclusion by a general mechanism of reasoning whatever the values of convictions are (not only 
restricted to values close to either 0 or 1). We examine now three ARS based on the Bayesian reasoning fl2l which turns 
to be fallacious and actually not appropriate for this problem and we explain why, on the Dempster-Shafer Theory (DST) 
fl!7) and on the Dezert-Smarandache Theory (DSmT) 12 1 1 . 

3 The fallacious Bayesian reasoning 

We first present the fallacious Bayesian reasoning solution drawn from the J. Pearl’s book in EO (pages 447-449) and 
then we explain why the solution which seems at the first glance correct with intuition is really fallacious. We then explain 
why the common rational intuition turns actually to be wrong. 

3.1 The Pearl’s analysis 

To preserve mathematical rigor, we introduce explicitly all information available in the derivations. In other words, one 
wants to evaluate using the Bayesian reasoning, the conditional probability, if it exists, P(T = f\0, R, W) = P(T = 
f\T p. r f = b. R, W). The Pearl’s analysis is based on the assumption that a conviction on a given rule can be 
interpreted as a conditional probability (see El page 4). In other words if one has a given rule a ^ b with w g [0, 1] 
then one can interpret, at least for the calculus, w as R(b\a) and thus the probability theory and Bayesian reasoning can 
help to answer to the question. We prove in the following section that such model cannot be reasonably adopted. For now, 
we just assume that such probabilistic model holds effectively as Judea Pearl does. Based on this assumption, since the 
conditional term/information (T = p,T = b, R, W) is strictly equivalent to ( T = p. R. W) because of the knowledge of 
rule 7-3 with certainty (since W 3 = 1), one gets easily the fallacious intuitive expected Pearl’s result: 

P{T = f\0, R. W) = P(T = f\T = p,T = b, R, W) 

P(T = f\0, R, W) = P(T = f\T = p. R, W) 

P(T = f\0, R, W) = 1 - P(T = -i/|T = p, R, W) 

P(T = f\0,R,W ) = l-w 1 = e 1 

From this simple analysis, the Tweety ’s ’’birdness” does not render her a better flyer than an ordinary penguin as intuitively 
expected and the probability that Tweety can fly remains very low which looks normal. We reemphasize here the fact, that 
in his Bayesian reasoning J. Pearl assumes that the weight w 1 for the conviction in rule ?’i can be interpreted in term of a 
real probability measure P(-i/|p). This assumption is necessary to provide the rigorous derivation of P(T = f\(). R, W). 
It turns out however that convictions Wi on logical rules cannot be interpreted in terms of probabilities as we will prove in 
the next section. 



When rule r 3 is not asserted with absolute certainty (i.e. w 3 = 1) but is subject to exceptions, i.e. u > 3 = 1 — £3 < 1, 
the fallacious Bayesian reasoning yields (where notations T = f,T = b and T = p are replaced by /, b and p for notation 
convenience): 



P(f\0,R,W) 

P(f\0,R,W) 

P{f\0,R,W ) 



P(f\p,b,R,W) 

P(f,P,b\R, W) 

P(p, b\R, W) 

P(f,b\P,R, W)P(p\R,W) 
P(b\p,R,W)P{p\R,W) 



By assuming P(p\R, W ) > 0, one gets after simplification by P(p\R, W) 



P(f\0, R, W) 
P{f\0,R,W ) 



p(f,b\p,R,w) 

P(b\p, R, W) 

P(b\f,p,R,W)P(f\p,R,W) 
P(b\p, R, W) 



If one assumes P(b\p , R, W) = w 3 = 1 — e 3 and P(f\p , R, W) = 1 — P(->/|p, R, W) = 1 — W\ = ei, one gets 



P(f\0, R, W) = P{b\f,p , R, W) x — i— 

i - e 3 

Because 0 < P(b\f,p, R, W) < 1, one finally gets the Pearl’s result I12l (p.448) 



P(f\0,R,W) < 



ei 

1 - £3 



(1) 



which states that the observed animal Tweety (a penguin-bird) has a very small probability of flying as long as e 3 re- 
mains small, regardless of how many birds cannot fly (£ 2 ), and has consequently a high probability of not flying because 
P(f\0, R, W) + P(f\0, R , W) = 1 since the events / and / are mutually exclusive and exhaustive (assuming that the 
Pearl’s probabilistic model holds ... ). 



3.2 The weakness of the Pearl’s analysis 

We prove now that the previous Bayesian reasoning is really fallacious and the problem is truly undecidable to conclude 
about the ability of Tweety to fly or not to fly if a deep analysis is done. Actually, the Bayes’ inference is not a classical 
inference 0. Indeed, before applying blindly the Bayesian reasoning as in the previous section, one first has to check 
that the probabilistic model is well-founded to characterize the convictions of the rules of the rule-based system under 
analysis. We prove here that such probabilistic model doesn’t hold for a suitable and useful representation of the problem 
and consequently for any problems based on the weighting of logical rules (with positive weighting factors/convictions 
below than 1 ). 



3.2.1 Preliminaries 

We just remind here only few important principles of the propositional calculus of the classical Mathematical Logic which 
will be used in our demonstration. A simple notation, which may appear as unusual for logicians, is adopted here just for 
convenience. A detailed presentation of the propositional calculus and Mathematical Logic can be easily found in many 



standard mathematical textbooks like Il6lll ll fTOil. Here are these important principles: 

• Third middle excluded principle : A logical variable is either true or false, i.e. 

aVifl (2) 

• Non-contradiction law : A logical variable can’t be both true and false, i.e. 

“i (a A - >a) (3) 

• Modus Ponens : This rule of the propositional calculus states that if a logical variable a is true and a — ► b is true, 
then b is true (syllogism principle), i.e. 

(a A (a — > b)) — > b (4) 

• Modus Tollens : This rule of the propositional calculus states that if a logical variable -1> is true and a - 6 is true, 
then ->a is true, i.e. 



• Modus Barbara : This rule of the propositional calculus states that if a — » b is true and b — > c is true then a — > c 
is true (transitivity property), i.e. 

((a — * b) A (b — > c)) — ► (a — > c) (6) 

From these principles, one can prove easily, based on the truth table method, the following property (more general 
deducibility theorems in Mathematical Logic can be found in 1 1_9' 20|) : 

((a — + b) A (c — > d)) — > ((a A c) — * (6 A d)) (7) 





Lt)l=l — €1=1 


fl 


: p -*• 




. ID 2 = 1 — 62 = 1 


f 2 


: b — > 




U)3 = l-£3 = 1 


f3 


: p ->• 



5.2.2 Analysis of the problem when e± = 62 = £3 = 0 

We first examine the TP2 when one has no doubt in the rules of our given rule-based systems, i.e. 

(-/) 

/ 

b 

From rules n and r2 and because of property 0, one concludes that 

p A 6 — » (/ A ->f) 

and using the non-contradiction law 0 with the Modus Tollens 0 . one finally gets 

-’(/ A -■/) -*• ^{p A 6) 



which proves that p A b is always false whatever the rule r3 is. Interpreted in terms of the probability theory, the event 
T = p fl b corresponds actually and truly to the impossible event 0 since T = f and T - f are exclusive and exhaustive 
events. Under such conditions, the analysis proves the non-existence of the penguin-bird Tweety. 



If one adopts the notations 1 of the probability theory, trying to derive P(T = f\T = p fl b) and P(T = f\T = p fl b) 
with the Bayesian reasoning is just impossible because from one of the axioms of the probability theory, one must have 
.P(0) = 0 and from the conditioning rule, one would get expressly for this problem the indeterminate expressions: 



and similarly 



P(T = f\T = pnb) 
P(T = f\T = pnb) 

p(T = f\T = pnb) 

P(T = f\T = pnb) 



P(T = f\T = 0) 

P(T = / n 0) 

P{T = 0) 

P{T = 0) 

P(T = 0) 

^ (indeterminate) 



P(T = f\T = pnb) 
P{T = f\T = p nb) 

p(T = f\T = P nb) 
P{T = f\T = pnb) 



P(T = f\T = 0) 

p(T = /n0) 

P(T = 0) 

P{T = 0) 

P(T = 0) 

— (indeterminate) 



5.2.5 Analysis of the problem when 0 < ei, € 2 , £3 < 1 



Let’s examine now the general case when one allows some little doubt on the rules characterized by taking ei > 0 , £2 > 0 
and £3 > 0 and examine the consequences on the probabilistic model on these rules. 



'Because probabilities are related to sets, we use here the common set-complement notation / instead of the logical negation 
notation -1/, fl for A and U for V if necessary. 



First note that, because of the third middle excluded principle and the assumption of the existence of a probabilistic 
model for a weighted rule, then one should be able to consider simultaneously both ’’probabilistic/Bayesian” rules 



P(b\a)=w 

a — > b 

P(b\a)=l—w 7 

a — > -i0 



( 8 ) 



In terms of classical (objective) probability theory, these weighted rules just indicate that in 100 x w percent of cases the 
logical variable b is true if a is true, or equivalently, that in 100 x w percent of cases the random event b occurs when the 
random event a occurs. When we don’t refer to classical probability theory, the weighting factors w and 1 — w indicate 
just the level of conviction committed to the validity of the rules. Although very appealing at the first glance, this prob- 
abilistic model hides actually a strong drawback/weakness specially when dealing with several rules as shown right below. 



Let’s prove first that from a ’’probabilized” rule a " 5 one cannot assess rigorously the convictions onto its 



Modus Tollens. In other words, from what can we conclude 



on 



P(a\b)=? 

1 b — ► —1 a 

P(a|b)=? 
b — > -id 



(9) 



From the Bayes’ rule of conditioning (which must hold if the probabilitic model holds), one can express P(a\b) and 
P(a\b) as follows 

/ P(a\b) = 1 - P(a\b) = 1 - = 1 _ 

\P(a\b) = 1 - P(a\b) = 1 - = 1 - P % P / a) 

or equivalently by replacing P(b\a) and P(b\a) by their values w and 1 — w, one gets 



P(o|6) = 1 - (1 - w)iz^y 

P{a\b) = 1 — u) p^y- 



(10) 



These relationships show that one cannot fully derive in theory P(a\b) and P(a\b) because the prior probabilities P(a) 
and P(b) are unknown. 

A simplistic solution, based on the principle of indifference, is then just to assume without solid justification that 
P(a) = P(a) = 1/2 and P(b) = P(b) = 1/2. With such assumption, then one gets the following estimates P(a\b) = w 
and P(a\b) = 1 — w for P(a\b) and P(a\b) respectively and we can go further in the derivations. 



Now let’s go back to our Tweety Penguin Triangle Problem. Based on the probabilistic model (assumed to hold), one 
starts now with both 

p (/|p)=i-ei r f„ p (/lp) =£l 

n ■■ p — > 

(ID 

P(b\p)=£3 , 



P(/|b) = l -£2 

r-i : b f 



P(b|p) = l-e 3 

,T3 ■ P — > b 



IP 



Note that taking into account our preliminary analysis and accepting the principle of indifference , one has also the two 
sets of weighted rules either 



, p(pI/)=i- £ i 

J 

P(b|/)=1— e 2 
n/ ->• -b 

P(p|6)=l-«s 
-iO — > -1 p 



r P(P|/)= £ 1 

v -»• - 1 P 

P(b|/)=e2 

P(p\b)=e 3 
b — > -1 p 



( 12 ) 



One wants to assess the convictions (assumed to correspond to some conditional probabilities) into the following rules 

P(/| P nb)=? 

pAb -> / (13) 



p A b 



P(/|pn&)=? 



V 



(14) 



The question is to derive rigorously P(f\p H b) and P(f\p 0 b) from all previous available information. It turns out that 
the derivation is impossible without unjustified extra assumption on conditional independence. Indeed, P(f\p FI b) and 



P(f\p n b) are given by 



P(f\p<lb) 



_ P(f,P,b) _ P(pMf)P(f) 
- P(p,b ) p(b\p)p(p) 



P(f\pnb) 



_ P(f,p,b) _ pjpMf)p(f) 
- P(p.b) P(b\p)P(p) 



( 15 ) 



If one assumes as J. Pearl does, that the conditional independence condition also holds, i.e. P(p, b\f) = P(p\f)P(b\f) 
and P(p : b\f) = P(p\f)P(b\f), then one gets 



P(f\pnb) 



P(p\f)P(b\f)P(f) 

P(b\p)P(p) 



p(f\pnb) 



P(p\f)P(b\f)P(f) 

P(b\p)P(p) 



By accepting again the principle of indifference, P(f) = P(f) = 1/2 and P(p) 



expressions 



P(f\ p nb) 



P(p\f)P(b\f) 
P{b\p ) 



P{ P ) 



P(f\ P nb) 



P(p\f)P(b\f) 

P(b\p) 



1 / 2 , one gets the following 



(16) 



Replacing probabilities P(p\f), P(b\f), P(b\p), P{p\f) and P(b\f) by their values in the formula ( I16> . one finally gets 



P(f\ P nb) = ^ ^ 
P(f\ P nb) = 



(17) 



Therefore we see that, even if one accepts the principle of indifference together with the conditional independence 
assumption, the approximated ’’probabilities” remain both small and do not correspond to a real measure of probability 
since the conditional probabilities of exclusive elements / and / do not add up to one. When ei, e -2 and 63 tends towards 
0 , one has 

P{f\p n 6) + P(f\p n 6) « 0 

Actually our analysis based on the principle of indifference, the conditional independence assumption and the model pro- 
posed by Judea Pearl, proves clearly the impossibility of the Bayesian reasoning to be applied rigorously on such kind of 
weighted rule-based system, because no probabilistic model exists for describing correctly the problem. This conclusion 
is actually not surprising taking into account the Lewis’ theorem tu) explicated in details in 0 (chapter 11 ). 



Let’s now explain the reason of the error in the fallacious reasoning which was looking coherent with the common 
intuition. The problem arises directly from the fact that penguin class and bird class are defined in this problem only 
with respect to the ’’flying” and ”not-flying” properties. If one considers only these properties, then none Tweety animal 
can be categorically classified as a penguin-bird, because penguin-birdness doesn’t not hold in reality based on these 
exclusive and exhaustive properties (if we consider only the information given within the rules r\, r% and r 3 ). Actually 
everybody knows that penguins are effectively classified as bird because ’’birdness” property is not defined with respect to 
the ’’flying” or ”not-flying” abilities of the animal but by other zoological characteristics C (birds are vertebral oviparous 
animals with hot blood, a beak, feather and anterior members are wings) and such information must be properly taken 
into account in the rule-based systems to avoid to fall in the trap of such fallacious reasoning. The intuition (which seems 
to justify the fallacious reasoning conclusion) for TP2 is actually biased because one already knows that penguins (which 
are truly classified as birds by some other criterions) do not fly in real world and thus we commit a low conviction (which 
is definitely not a probability measure, but rather a belief) to the fact that a penguin-bird can fly. Thus the Pear’ls analysis 
proposed in m appears to the authors to be unfortunately incomplete and somehow fallacious. 



4 The Dempster- Shafer reasoning 

As pointed out by Judea Pearl in 1121 . the Dempster-Shafer reasoning yields, for this problem, a very counter-intuitive 
result: birdness seems to endow Tweety with extra flying power ! We present here our analysis of this problem based on 
the Dempster-Shafer reasoning. 

Let’s examine in detail the available prior information summarized by the rule n: "Penguins normally don’t fly" O 
(p —r - 1 /) with the conviction wi = 1 — ei where ei is a small positive number close to zero. This information, in the 
DST framework, has to be correctly represented in term of a conditional belief Be 1 1 (,/jp) = 1 — ei rather than directly 



the mass mi(/ Tip) = 1 — ei. 

Choosing Beli(/|p) = 1 — ei means that there is a high degree of belief that a penguin-animal is also a nonflying- 
animal (whatever kind of animal we are observing). This representation reflects perfectly our prior knowledge while the 
erroneous coarse modeling based on the commitment m\{f f~l p) = 1 — ei is unable to distinguish between rule n and 
another (possibly erroneous) rule like r( : (->/ — > p) having same conviction value w\. This correct model allows us to 
distinguish between n and r( (even if they have the same numerical level of conviction) by considering the two different 
conditional beliefs Beli(/|p) = 1 — ei and Bel]/ (p\f) = 1 — ei. The coarse/inadequate basic belief assignment model- 
ing (if adopted) in contrary would make no distinction between those two rules rq and r\ since one would have to take 
m i (/ Hp) = mv (pH/) and therefore cannot serve as the starting model for the analysis 

Similarly, the prior information relative to rules ?’2 : (6 — > /) and : (p — > b) with convictions W2 = 1 — ti and 
w 3 = 1 — €3 has to be modeled by the conditional beliefs Bel2(/|0) = 1 — 62 and Bel3(6|p) = 1 — 63 respectively. 

The first problem we have to face now is the combination of these three prior information characterized by Bel] (f\p) = 
1 — ei, Bel2(/|6) = 1 — 62 and Bel3(6|p) = 1 — 63. All the available prior information can be viewed actually as three in- 
dependent bodies of evidence £>1, £>2 and £>3 providing separately the partial knowledges summarized through the values 
of Bel] (/|p), Bel2(/|6) and Bel3(6|p). To achieve the combination, one needs to define complete basic belief assign- 
ments TOi(.), m2(.) and W3(.) compatible with the partial conditional beliefs Bel](/|p) = 1 — ei, Bel2(/|6) = 1 — 62 
and Bel 3 (6|p) = 1 — 63 without introducing extra knowledge. We don’t want to introduce in the derivations some extra- 
information we don’t have in reality. We present in details the justification for the choice of assignment Wi(.). The choice 
for TO2(.) and mz(.) will follow similarly. 

The body of evidence £>1 provides some information only about / and p through the value of Beli(/|p) and without 
reference to b. Therefore the frame of discernment 0 i induced by B\ and satisfying the Shafer’s model (i.e. a finite set of 
exhaustive and exclusive elements) corresponds to 

©r = (6»t = fnp,e 2 = /np,6> 3 = /np,0 4 = /np} 



schematically represented by 

P—63U64 



}/ = (h U 0 3 



p= 0 iU 62 

The complete basic assignment mi (.) we are searching for and defined over the power set 2 01 which must be compatible 
with Bel] (/|p) is actually the result of the Dempster’s combination of an unknown (for now) basic belief assignment 
with the particular assignment m"{.) defined by m"(p = 6*3 U O4 ) = 1 ; in other worlds, one has 

mi(.) = [mi ® m"]{.) 

From now on, we introduce explicitly the conditioning term in our notation to avoid confusion and thus we use mi (. |p) = 
mi(.|(?3 U 64) instead m/.). From m'{(p = O3 U O4) = 1 and from any generic unknow basic assignment m' 1 (.) defined 
by its components m' 1 ( 0 ) = 0 , m'^62), m'^63), mi (#4), m[( 9 i U #2). m[( 9 i U 9 3 ), m.[( 9 i U 64), m .((02 U 9 3 ), 

mi (02 U ^4). mi (03 U 0 i), mi ( 0 i U 02 U03), mi( 0 i U 02 U04), m(( 0 i U03 U04), mi (02 U 03 U 04), mi( 0 i U 02 U03 U04) 
and applying Dempter’s rule, one gets easily the following expressions for mi(.|03 U 04). All mi(.|03 U 04) masses are 
zero except theoretically 

1 

mi ( 0 3 1 0 3 U 0 4 ) = m'l ( 0 3 U 0 4 )[mi( 0 3 ) + mi( 0 i U 0 3 ) 

+ mi ( 0 2 U 0 3 ) 

+ mi(0i U0 2 U0 3 )]/iTi 



= m'[{9 3 U 04)[mi(04) + mi (0i U 9 3 ) 
+ mi (02 U 04) 

+ mi (0i U 02 U 04)]/ K\ 



f — 02 U 04 | 




mi (04 1 03 U 04 ) 




1 



mi (0 3 U 0 4 |0 3 U 0 4 ) = m"(9 3 U 0 4 )[m' 1 (0; 3 U 0 4 ) 

+ m .\ ( 6 \ U 0 3 U 0 4 ) 

+ 777 ( (02 U 03 U 0 4 ) 

+ 777^(01 U 0 2 U 0 3 U 0 4 )]//Vi 

with 

l 

ivi = 1 - 77l"(0 3 U 0 4 )[m , 1 (0l) + 7713(02) + 777-3 (01 U 02)] 

To complete the derivation of mi(.|03 U 0 4 ), one needs to use the fact that one knows that Beli(/|p) = 1 — 63 which, 
by definition G3 is expressed by 

Bel3(/» = Beli(0iU0 3 |03U0 4 ) 

Beli(/|p) = 7771 (01 1 0 3 U 0 4 ) + 777l(0 3 |03 U 0 4 ) 

+ 7713 ( 03 U 0 3 |0 3 U 0 4 ) 

Bell (f\p) = 1 - ei 

But from the generic expression of 777-1 (.| 03 U 0 4 ), one knows also that mi(0i |03 U 0 4 ) = 0 and 777i(0i U 03|03 U 0 4 ) = 0. 
Thus the knowledge of Beli(/|p) = 1 — ei implies to have 

777l(0 3 |03 u 0 4 ) = [7773(03) + 7773 (01 U 03 ) 

+ 777'3(02 U0 3 ) 

T 777-3 (01 □ 02 U 0 3 )]/ K ± 

777l(03|03 U 0 4 ) = 1 - €i 

This is however not sufficient to fully define the values of all components of 7771 (.| 03 U 0 4 ) or equivalently of all 
components of m\ (.). To complete the derivation without extra unjustified specific information, one needs to apply the 
minimal commitment principle (MCP) which states that one should never give more support to the truth of a proposition 
than justified |'9j. According to this principle, we commit a non null value only to the less specific proposition involved 
into 777i (03 1 03 U 0 4 ) expression. In other words, the MCP allows us to choose legitimately 

7773(01) = 7773(02) = 777 , 1 (0 3 ) = 0 

7773(03 U 0 2 ) = 7773(03 U O 3 ) = 7773(02 U 0 3 ) = 0 
7773 (0i U 02 U 03) 7^ 0 

Thus A'i = 1 and 7771 (03 1 03 U 0 4 ) reduces to 

777l(03|03 u 0 4 ) = 7773 (01 U 02 U 0 3 ) = 1 - ei 

Since the sum of basic belief assignments must be one, one must also have for the remaining (uncommitted for now) 
masses of 7773 (.) the constraint 

777-3(04) + TO i(^t U 04) + 777-3(02 U 0 4 ) + 777.3(01 U 02 U 0 4 ) 

+777.3(03 U 04 ) + 777.3(03 U 03 U 04 ) + 7773(02 U 03 U 04 ) 

+777.3(03 U 02 U 03 U 0 4 ) = £3 

By applying a second time the MCP, one chooses 7773(03 U 02 U 03 U 0 4 ) = £3. 

Finally, the complete and less specific belief assignment 7773 (.|p) compatible with the available prior information 
BeM/lp) = 1 — £i provided by the source B\ reduces to 

777l(03|03 U 0 4 ) = 7773(01 U 0 2 U 0 3 ) = 1 £i (18) 

7771 (03 U 0 4 1 03 U 0 4 ) = 7773(01 U 02 U 03 U 0 4 ) = £l (19) 



mi(fr\p\p) = 777 3 (p U / ) = 1 - £l 

mi(p\p) = m'i(pU/UpU/) = £1 



or equivalently 



( 20 ) 

( 21 ) 



It is easy to check, from the mass mi(.|p), that one gets effectively Beli(/|p) = 1 — ei. Indeed: 

Eeli(/|p) =Bel 1 (0 1 U 0 3 \p) 

Bell (/|p) = Beli((/ np) U (/ 0p)|p) 

Bel i (f\p) = rm (/ n p\p) +rm (/ n p\p) 

0 

+ m 1 ((fnp) u (fnp)\p) 

s. y 

0 

Beli (/|p) = wi(/ np|p) 

Bel i(/b) = i - ei 

In a similar way, for the source B 2 with 0 2 defined as 

02 = {01 = fnb, 0 2 = bnf , 03 4/nfc,04 = / 06 } 



schematically represented by 



6=03 U#4 



f = 0 2 U 



04 { 







= 0i U 0 3 



U 02 



one looks for TO 2 (. 1 6) = [m' 2 ®m 2 ](.) with m' 2 (b) = m 2 (9 3 L) 6 * 4 ) = 1. From the MCP, the condition Bel 2 (/| 6 ) = 
and with simple algebraic manipulations, one finally gets 



m 2 ( 03|03 U 9a) = m! 2 (9 1 U 0 2 U 0 3 ) = 1 - e 2 
1712(93 U 9a\9^ U 9a) = m{(0i U 0 2 U 0 3 U 04 ) = e 2 



or equivalently 



m 2 (f n b\b) = m 2 (b U /) = 1 - e 2 
m 2 ( 6 | 6 ) = m' 2 (b U / U b U /) = e 2 

In a similar way, for the source £>3 with 0 ;i defined as 

03 = {0i = b n p, 0 2 = b n p, 03 = p n 6 , 04 = b n p} 



schematically represented by 



p=03U04 



b = 0 2 U 



04 




}& = 0 1 U 0 3 



p— 01 U02 



one looks for TO 3 (.|p) = [m 3 0 rri 3 }(.) with m 3 (p) = m 3 (9 3 U 04 ) = 1. From the MCP, the condition Bel 3 ( 6 |p) = 
and with simple algebraic manipulations, one finally gets 



or equivalently 



i7i 3 (9 3 1 03 U 04 ) — iti' 3 (9i U 0 2 U 9 3 ) — 1 — e 3 
m 3 {9 3 U 04 10 3 U 04 ) = m 3 (9i U 0 2 U 0 3 U 04 ) = e 3 

m 3 (6 (~l p|p) = to 3 (p U b) = 1 — e 3 
m 3(p\p) = m 3(0 U p U b U p) = £3 



1 — £2 

( 22 ) 

(23) 



(24) 

(25) 



1 — e 3 

(26) 

(27) 



(28) 

(29) 



Since all the complete prior basic belief assignments are available, one can combine them with the Dempster’s 
rule to summarize all our prior knowledge drawn from our simple rule-based expert system characterized by rules 





R = {ri, 7"2, 7-3} and convictions/confidences W = {w\,w 2 ,w 3 } in these rules. 



The fusion operation requires to primilarily choose the following frame of discernment 0 (satisfying the Shafer’s 
model) given by 

0 = { 01 , 6 * 2 , 03 , 04 , 05 , 06 , 07 , 08 } 

where 



n p 


0 5 = fn 


np 


0 6 = fn 


n p 


07 = fn 


np 


08 = fn 



The fusion of masses mi(.) given by eqs. < 1201 - GJ with TO2(.) given by eqs. - J 25 I using the Demspter’s rule of 
combination (13 yields mi2(.) = [mi ® TO2](.) with the following non null components 

mi 2 (/ n bCip) = ei(l - e 2 )/K 12 
mi 2 {f nbCip) = €2(1 - ei)/K\2 
m\ 2 {b n p) = eie 2 /K 12 

with A’12 = 1 — (1 — ei)(l - e 2 ) = ei + e 2 - eie 2 . 



The fusion of all prior knowledge by the Dempster’s rule 771123 (.) = [777.1 ® m 2 ® 7713] (.) = [777.12 ® 777.3] (•) yields the 
final result : 

77ll23(/n6np) = 771123 (01 ) = Cl(l - e 2 )/A'l23 
777l23(/n6np) =mi 23 ( 05 ) = C 2 (1 - Cl)/K 12 3 

mi23(0(Tp) = 777123(01 U 05) = eie 2 /K 323 

with I \\ 23 = Ki 2 = 1 — (1 — 6 i ) (1 — e 2 ) = + e 2 — eie 2 - 

which defines actually and precisely the conditional belief assignment mi23(-|p n b). It turns out that the fusion with the 
last basic belief assignment m3 ( . ) brings no change with respect to previous fusion result m.12 ( . ) in this particular problem. 



Since we are actually interested to assess the belief that our observed particular penguin-animal named Tweety (de- 
noted as T = ( pCib )) can fly, we need to combine all our prior knowledge 777.123 (.) drawn from our rule -based system with 
the belief assignment m 0 (T = {p (T b)) = 1 characterizing the observation about Tweety. Applying again the Demspter’s 
rule, one finally gets the resulting conditional basic belief function ? 77 0 i 2 3 = [m 0 ® 771123] (•) defined by 

m ol23 (T = (/ n bnp)\T = (pH b)) = ei(l - e 2 )/K 12 
m ol23 (T = (fnbnp)\T = {pnb)) = e 2 (l - £i)/K 12 
m ol23 {T = [b Pi p)\T = {pflb)) = eie 2 /K 12 

From the Dempster-Shafer reasoning, the belief and plausibity that Tweety can fly are given by H3 



Bel(T = f\T = (p fl b)) = 



T, m ol23 {T = x\T = {pnb)) 

xG2 e ,(cC / 



pi{T = f\T = { P nb)) = 



y m ol23 {T = x\T = {pnb)) 



xg2 e ,xr\f^il 

Because / = [(/ D b np) U (/ D b np) U (/ f~l bnp) U (/ (T b Dp)] and the specific values of the masses defining m o323 {.), 
one has 

Bel(T = f\T = {pnb)) = 



1 A 



m 0 i 23 {T = {fnbnp)\T = {pnb)) 



p\(t = f\T = (pnb)) = 



and finally 



m ol23 (T = (/ n b n p)\T = (p n b )) 



m 0 \2 3 (T = (bn p)\T = (p n b )) 



Bel(T = f\T = (pfl b)) = 



ei(l - £2) 



Pl(T = f\T=(pnb)) = 



K v2 

ei(l — e 2 ) 6l£ 2 



A'i2 A 12 

In a similar way, one will get for the belief and the plausibility that Tweety cannot fly 

Be\(T= f\T=(pnb))= 62(1 _ei ) 



£i 

A 12 



P1(T = f\T =(pn b)) = 



K\2 

£2(1-61) 6 i £ 2 



£2 

K V2 



K 12 Ai 2 

Using the first order approximation when ei and e 2 are very small positive numbers, one gets finally 

£l 



Bel(T = f\T = (p n b )) = P1(T = f\T = (p n b)) 
In a similar way, one will get for the belief that Tweety cannot fly 

Bel(T = f\T = (p D b)) = P1(T = f\T = (p n b)) 



£l + £2 



£2 

£1 + £2 



(30) 

(31) 

(32) 

(33) 



This result coincides with the Judea Pearl’s result but a different analysis and detailed presentation has been done here. 
It turns out that this simple and complete analysis corresponds actually to the ballooning extension and the generalized 
Bayesian theorem proposed by Smets in I22II25I and discussed by Shafer in GU although it was carried out independently 
of Smets’ works. As pointed out by Judea Pearl, this result based on DST and the Dempster’s rule of combination looks 
very paradoxical/counter-intuitive since it means that if nonflying birds are very rare, i.e. e 2 « 0, then penguin-birds like 
our observed penguin-bird Tweety, have a very big chance of flying. As stated by Judea Pearl in fl2l pages 448-449: 
’’The clash with intuition revolves not around the exact numerical value of Bel(f) but rather around the unacceptable 
phenomenon that rule r 3 , stating that penguins are a subclass of birds, plays no role in the analysis. Knowing that Tweety 
is both a penguin and a bird renders Bel(T = f\T = (p (T b)) solely a function of mi(.) and m 2 (.), regardless of 
how penguins and birds are related. This stands contrary to common discourse, where people expect class properties to 
be overridden by properties of more specific subclasses. While in classical logic the three rules in our example would 
yield an unforgivable contradiction, the uncertainties attached to these rules, together with Dempster’s normalization, 
now render them manageable. However, they are managed in the wrong way whenever we interpret if-then rules as 
randomized logical formulas of the material-implication type, instead of statements of conditional probabilities” . Keep 
in mind that this Pearl’s statement is however given to show the semantic clash between the Dempster-Shafer reasoning 
vs. the fallacious Bayesian reasoning to support the Bayesian reasoning approach. 



5 The Dezert-Smarandache reasoning 

Before going further in our analysis, some clarification is necessary to explain to the reader the fundamental difference 
between the foundations of DSmT vs. DST. The DSmT can be easily viewed as a general flexible Bottom-Up approach 
for managing uncertainty and conflicts in fusion problems. It arises from the fact that the conflict between sources of 
evidence can come not only from the reliability of sources themselve (which can be handled quite easily by classical dis- 
counting methods) but also from a different interpretation of elements of the frame just because the sources or evidence 
have only a limited knowlege and provide their beliefs only with respect to their knowledge based usually on their own 
(local) experience, not to mention the fact that elements of the frame of the problem can truly be not refinable at all in 
some cases involving vague concepts like smallness/tallness, pleasure/pain, etc because of the continuous path from one to 
the other, etc. Based on this matter of fact, the DSmT proposes a new mathematical framework which starts at the bottom 
level (solid ground level) from the free DSm model and the notion of hyper-power set (Dedekind’s lattice), then provides 
a general rule of combination to work with the free DSm model. Then it includes the possibility to take into account 
any kind of integrity constraints into the free DSm model if necessary through the hybrid DSm rule of combination. The 
taking into account for an integrity constraint consists just in forcing some elements of the Dedekind’s lattice to be empty, 
just because they truly are for some given problems. 



1 1 



The introduction of an integrity constraint is like ’’pushing an elevator button” for going a bit up in the process of 
managing uncertainty and conflicts. If one needs to go higher, then one can take into account several integrity constraints 
as well in the framework of DSmT. If we finally wants to take into account all possible exclusivity constraints if we know 
that all elements of the frame of the given problem under consideration are truly exclusive, then we go directly to the Top 
level (the Shafer’s model which serves as foundation for the DST). 

DSmT however can handle not only exclusivity constraints, but also existential constraints or mixed constraints as 
well which is helpful for some dynamic fusion problems. It is also important to emphaze that the hybrid DSm rule of 
combination is definitely not equivalent to the Dempster’s rule of combination (and its alternatives based on the Top level) 
because one can stop and work at any level in the process of managing uncertainty and conflicts, depending on the nature 
of the problem. The hybrid DSm rule and Dempster’s rule do not provide same results even if working with the Shafer’s 
model as it will be proved in the sequel. The approach proposed by the DSmT to attack the fusion problem is totally new 
both by its foundations and the solution provided. 

The DSmT has been originally (ground-level) developed for the fusion of uncertain and paradoxical (highly conflict- 
ing) sources of information (bodies of evidences) based on the free DSm model which assumes that none of 

elements of the frame 0 are exclusive. This model is opposite to the Shafer’s model. Let consider a free DSm model 
Ad^(0) with 0 = {0i, . . . , 0„}, the DSmT starts with the notion of hyper-power set D e defined as the set of all com- 
posite propositions built from elements of 0 with U and D (0 generates D e under operators U and IT) operators such that 
0 

1. 0,01 ,...,O n eD e . 

2. If A, B <= D e , then A (T B e D e and A U B £ D e . 

3. No other elements belong to D e , except those obtained by using rules 1 or 2. 

The cardinality of hyper-power set, dip) = D® for n > 1, follows the sequence of Dedekind’s numbers 1, 2, 5, 19, 167, 
7580, 7828353, ... More details about the generation and partial ordering of elements of hyper-power set can be found in 
HE] ED. From this model, authors have proposed a new simple associative and commutative rule of combination (the 
DSm classic rule) and then extended this rule to deal with any kind of hybrid models, i.e. sets 0 for which some proposi- 
tions/elements of D e are known or forced to be empty depending on the nature and the dynamicity of the fusion problem 
under consideration. In this framework, the Shafer’s model appears only as a special hybrid model (the most constrained 
one, if we don’t introduce existential constraints). The hybrid DSm fusion rule covers a wide class of fusion applications 
but is restricted to fusion of precise uncertain and paradoxical information only ED We have recently extended this rule 
with new set operators for the fusion of imprecise, uncertain and paradoxical information - see ED for details. 

We analyze here the Tweety penguin triangle problem with the DSmT. The prior knowledge characterized by the rules 
R = { r-\ , r- 2 , r 3 } and convictions W = { iu-\ , up . w A } is modeled as three independent sources of evidence defined on 
separate minimal and potentially paradoxical (i.e internal conflicting) frames 0! = {p, /}, 0 2 = { b , /} and 0 3 = {p, h\ 
since the rule rp doesn’t refer to the existence of b, the rule r 2 doesn’t refer to the existence of p and the rule r 3 doesn’t 
refer to the existence of / or /. Let’s note that the DSmT doesn’t require the refinement of frames as with DST (see 
previous section). We follow the same analysis as in previous section but now based on our DSm reasoning and the DSm 
rule of combination. 

The first source B\ relative to n with confidence w\ = 1 — ei provides us the conditional belief Bel ] (f\p) which 
is now defined from a paradoxical basic belief assignment TOi(.) resulting from the DSm combination of m" (p) = 1 
with m'j (.) defined on the hyper-power set Z?® 1 = {0,p, /,p D f,p U /}. The choice for m' 1 (.) results directly from the 
derivation of the DSm rule and the application of the MCP. Indeed, the non null components of toi(.) are given by (we 
introduce explicitly the conditioning term in notation for convenience): 

l l 

mi (p\p) = m" (p) m[ (p) + to" (p) to } {p U /) 

1 1 

mi {p n f\p ) = to" (p) m\ (/) + to" (p) m[ {p n /) 



The information Beli(/|p) = 1 — ei implies 

Bel i (/Ip) = mi (/|p) + mi (p n /|p) = 1 - e 3 



1 O 



Since m\{jp\p) + mi(pfl f\p) = 1, one has necessarily rn\(f\p) = 0 and thus from previous equation mi (/ Hp|p) = 
1 — e\, which implies both 



Wi(p|p) = Cl 

1 1 

mi ( p (~l f\p) = m'l (p) m\ (/) + m" (p) m[ ( p n /) 

= ™'i(/) + mi(pn/) 

= 1 - 6 ! 



Applying the MCP, it results that one must choose 

mi (/) = 1 - a and m [ (p n / ) = 0 
The sum of remaining masses of rn\ (.) must be then equal to ei, i.e. 



m i (?) + m[(p U f) = a 

Applying again the MCP on this last constraint, one gets naturally 

m'lip) — 0 and m'^p U /) = ei 

Finally the belief assignment mi(.\p) relative to the source B\ and compatible with the constraint Beli(/|p) = 1 — ei, 
holds the same numerical values as within the DST analysis (see eqs. ( I20M21H and is given by 

mi(prf\p) = 1 - e x 
mi(p\p) = ei 

but results here from the DSm combination of the two following assignments (i.e. mi(.) = [mi®m"](.) = [m , 1 , ®m' 1 ](.)) 



i'i if) = 1 - ei and mi(p U /) = a 

Aip) = 1 



(34) 



In a similarly manner and working on 02 = {6, /} for source £>2 with the condition B e 1 2 ( ./ 1 ) = 1 — £ 2 , the mass 
m 2 (. 1 6) results from the internal DSm combination of the two following assignments 



^ 2 (/) = 1 — ea and m' 2 (bUf) = e 2 
= 1 



(35) 



Similarly and working on 83 = {p, 6 } for source £>3 with the condition Bel 3 ( 6 |p) = 1 — 63 , the mass 7713 (.|p) results 
from the internal DSm combination of the two following assignments 

fm' 3 (b) = l- e 3 and m' 3 (b U p) = e 3 

= 1 <36> 

It can be easily verified that these (less specific) basic belief assignments generates the conditions BeT ( f In) = 1 — ei, 
Bel 2 (/|fo) = 1 - e 2 and Bel 3 (6|p) = 1 - e 3 . 

Now let’s examine the result of the fusion of all these masses based on DSmT, i.e by applying the DSm rule of 
combination of the following basic belief assignments 

mi(p n f\p) = 1 - ei and mi(p\p) = a 

m 2 {b n f\b) = 1 — e 2 and m 2 (bjb) = e 2 

m 3 (p D b\p) = 1 - e 3 and m 3 (p\p) = e 3 

Note that these basic belief assignments turn to be identical to those drawn from DST framework analysis done in 
previous section for this specific problem because of integrity constraint / n / = 0 and the MCP, but result actually from 
a slightly different and simpler analysis here drawn from DSmT. So we attack the TP2 with the same information as with 
the analysis based on DST, but we will show that a coherent conclusion can be drawn with DSm reasoning. 



Let’s emphasize now that one has to deal here with the hypotheses/elements p , &, / and / and thus our global frame 
is given by 0 = {h. p. /, /}. Note that 0 doesn’t satisfy the Shafer’s model since the elements of 0 are not all exclusive. 
This is a major difference between the foundations of DSmT with respect to the foundations of DST. But because only 
/ and / are truly exclusive, i.e. / Hi / = 0 , we face a simple hybrid DSm model M and thus the hybrid DSm fusion 
must apply rather than the classic DSm rule. We recall briefly here (a complete derivation, justification and examples can 
be found in lED) the hybrid DSm rule of combination associated to a given hybrid DSm model for k > 2 independent 
sources of information is defined for all A £ D e as: 



— <t>(A) S\(A) + S 2 (A) + S 3 (A) 



( 37 ) 



where f(A) is the characteristic emptiness function of the set A, i.e. <j>(A) = 1 if A 0 (0 = { 0 , 0 at} being the set of 
all relatively and absolutely empty elements) and f(A) = 0 otherwise, and 

k 

Si(A)± J 2 IH (*0 ( 38 ) 

.Yi,X 2 ,...,X fc GU e i=1 

(. x 1 nx 2 n...nx k )=A 



k 

S 2 (A)± J2 l[M^) ( 39 ) 

x 1 ,x 2 ,...,x fc e0 *= i 

[w=A]v[(we0)A(A=/ t )] 

k 

Ss(A) A J2 liMXi) ( 40 ) 

x u x 2 ,...,x k e D e *= l 

(X1UX2U...U X k )=A 

(x 1 nx 2 n...nx k )e0 

with U = u(X 1) U u(X 2) U . . . U u( Xk) where u{X) is the union of all singletons 9 i that compose X and I t = 
9 \ U 0 2 U . . . U 0 n is the total ignorance defined on the frame 0 = { 0 \, . . . , (),, }. For example, if A' is a singleton then 
u( X) = X\ if A' = 9 \ (T 0 2 or X = 9 i U 0 2 then u(X) = 9 1 U 9 2; if A = ( 9 \ D 0 2 ) U 0 3 then u( X) = 9 \ U 0 2 U d 3 ; by 
convention u(0) = 0. 



The first sum Si ( A ) entering in the previous formula corresponds to mass m_v< 1 (e) ( A ) obtained by the classic DSm 
rule of combination based on the free DSm model M.? (i.e. on the free lattice D e ). The second sum S2(A) entering in the 
formula of the hybrid DSm rule of combination cm represents the mass of all relatively and absolutely empty sets which 
is transferred to the total or relative ignorances. The third sum S3 (A) entering in the formula of the hybrid DSm rule of 
combination G3 transfers the sum of relatively empty sets to the non-empty sets in the same way as it was calculated 
following the DSm classic rule. 

To apply the DSm hybrid fusion rule formula d 37 > . it is important to note that (pnf)<l{bnf)np=pnbnfnf = 0 
because / n / = 0 , thus the mass (1 — ei ) (1 — 62)63 is transferred to the hybrid proposition Hi = (p n /) U (6 D /) Up = 
( b fl /) Up; similarly (p D /) (T (6 (~l /) fl (p (~l 6) = pn6n/n / = 0 because / f~l / = 0 and therefore its associated mass 
(1 — e 1 ) ( 1 — 62)(1 — 63) is transferred to the hybrid proposition H 2 = (p f~l /) U (6 f~l /) U (p fl b). No other mass transfer 
is necessary for this Tweety Penguin Triangle Problem and thus we finally get from DSm hybrid fusion formula d 37 l the 
following result for 771123 (.|p l~l b) = [to 1 0 m 2 0 TO3] (.) (where 0 symbol corresponds here to the DSm fusion operator): 

77 ll 23 (TTl|pn 6 ) = (1 6 1 ) ( 1 — 62)63 

mi 2 3 (H 2 \p n b) = (1 - ei)(l - e 2 )(l - e 3 ) 

777123 (p n b D /|p n b) = (1 - ei)e 2 e 3 + (1 - 61)62(1 - e 3 ) 

TO123 (pn 6 n /|p n 6) = ei(l - e 2 )e 3 + ei(l - e 2 )(l - e 3 ) 

777123 (p n 6 |p n b) = eie 2 e 3 + 6i 6 2 (1 - e 3 ) 



with 



Hi = (bC\ f) Up 

H2 = (p n /) u (b n /) u (p n b) 



It can be easily checked that these masses sum up to 1 . After elementary algebraic simplifications, one finally gets for 



the DSm fusion of all available prior information and reintroducing explicitly the conditioning term 



TOi23(#i|pn&) = (1 - ei)(l - e 2 )e 3 
m 12 3 {H 2 \p (~l b) = (1 - ei)(l - e 2 )(l - £ 3 ) 
m 123 (jp n b n f\p n b) = (1 - ei)e 2 
m 123 (p n b n f\p n b) = £i(l - e 2 ) 

mi 23 (j) fl b\p ("I b) = eie 2 

We can check all these masses add up to 1 and that this result is fully coherent with the rational intuition specially 
when e 3 = 0, because non null components of m\ 23 {.\p fl b) reduces to 

m\ 23 (H 2 \p fl 6) = (1 - £i)(l - e 2 ) 
m 123 (p n b n f\p n b) = (1 - £i)e 2 
mi 23 (p n b n f\p n b) = ei(l - e 2 ) 
m 123 (p n b\p n b) = £i £ 2 

which means that from our DSm reasoning there is a strong uncertainty (due to the conflicting rules of our rule-based 
system), when ei and £ 2 remain small positive numbers, that a penguin-bird animal is either a penguin-nonflying animal 
or a bird-flying animal. The small value eie 2 for mi 23 (p fl b\p fl b) expresses adequately the fact that we cannot commit 

a strong basic belief assignment only to p ft b knowing p fl b just because one works on 0 = { p , b , /, /} and we cannot 

consider the property p fl b solely because the ’’birdness” or ’’penguinness” property endow necessary either the flying or 
non-flying property. 

Therefore the belief that the particular observed penguin-bird animal Tweety (corresponding to the particular mass 
m 0 (T = (j)fl 6)) = 1) can be easily derived from the DSm fusion of all our prior summarized by mi 23 (.|p (T b) and the 
available observation summarized by m Q (.) and we get 

m om (T = (p D b n f)\T = (p PI b)) = (1 - £i)e 2 

m ol23 {T = {p (T b n f)\T = {p n b)) = £ 1(1 - e 2 ) 

m ol23 [T = (pnb)\T = {p n b)) = £ie 2 

m ol23 [T = Hi\T = (p n b)) = (1 - £i)(l - e 2 )£ 3 

m ol23 {T = H 2 \T = (p n b )) = (1 - ei )(l - e 2 )(1 - e 3 ) 

From the DSm reasoning, the belief that Tweety can fly is then given by 

Bel(T = f\T = (p fl b)) = Y, rn ol23 {T = x\T=(pnb)) 

x£D & ,xC.f 

Using all the components of m 0 i 23 (.|T = (p IT b)), one directly gets 

Bei(T = f\T = (P n b)) = to o123 (t = (/ n b n p )\t = ( P n b )) 



and finally 

Bel(T = f\T = (pfl b)) = ei(l — e 2 ) 
In a similar way, one will get for the belief that Tweety cannot fly 

Bel(T = /|T=(pn6)) = e 2 (l-£ 1 ) 



(41) 



(42) 



So now for both cases the beliefs remain very low which is normal and coherent with analysis done in section l3~2l 
Now let’s examine the plausibilities of the ability for Tweety to fly or not to fly. These are given by 

P1(T = f\T= (plT&)) = Y m oi 2 3 {T = x\T = (pH b)) 

xGD e ,xnf^ 

P1(T = f\T = (p n b)) = Y rn ol23 (T = x\T=( P nb)) 

xGD© ,xC\f^£ 

which turn to be after elementary algebraic manipulations 



P1(T = f\T = (p D b)) = (1 - e 2 ) 



(43) 



P1(T = f\T = (p n b)) = (1 - Cl ) (44) 

So we conclude, as expected, that we can’t decide on the ability for Tweety of flying or of not flying, since one has 

[Bel (/|p n b), Pl(/|p n b)] = [er(l - e 2 ), (1 - e 2 )] « [0,1] 

[Bel(/|p n 6), PI (f\p n b)} = [e 2 (l - ei), (1 - £i)] « [0, 1] 

Note that when setting e\ = 0 and £2 = 1 (or ei = 1 and e 2 = 0), i.e. one forces the full consistency of the initial 
rules-based system, one gets coherent result on the certainty of the ability of Tweety to not fly (or to fly respectively). 

This coherent result (radically different from the one based on Dempster-Shafer reasoning but starting with exactly 
the same available information) comes from the DSm hybrid fusion rule which transfers some parts of the mass of empty 
set m(0) = (1 — ei)(l — e 2 )e3 + (1 — £i)(l — e 2 )(l — £3) « 1 onto propositions H\ and ff 2 . It is clear however that 
the high value of m(0) in this TP2 indicates a high conflicting fusion problem which proves that the TP2 is a truly almost 
impossible problem and the fusion result based on DSmT reasoning allows us to conclude on the true undecidability on 
the ability for Tweety of flying or of not flying. In other words, the fusion based on DSmT can be applied adequately on 
this almost impossible problem and concludes correctly on its undecidability. Another simplistic solution would consist 
to say naturally that the problem has to be considered as an impossible one just because m(0) > 0.5. 

6 Conclusion 

In this paper we have proposed a deep analysis of the challenging Tweety Penguin Triangle Problem. The analysis proves 
that the Bayesian reasoning cannot be mathematically justified to characterize the problem because the probabilistic 
model doesn’t hold, even with the help of acceptance of the principle of indifference and the conditional independence 
assumption. Any conclusions drawn from such representation of the problem based on a hypothetical probabilistic model 
are based actually on a fallacious Bayesian reasoning. This is a fundamental result. Then one has shown how the 
Dempster-Shafer reasoning manages in what we feel is a wrong way the uncertainty and the conflict in this problem. We 
then proved that the DSmT can deal properly with this problem and provides a well-founded and reasonable conclusion 
about the undecidability of its solution. 
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